Rank in Wordlist | Frequency | Word |
---|---|---|
1882 | 129 | 2,30 |
2316 | 101 | 1,5 |
3270 | 70 | 0,5 |
3272 | 70 | 2,5 |
3407 | 67 | 1,2 |
3721 | 60 | 1,4 |
3784 | 59 | 0,1 |
4017 | 55 | 1,7 |
4353 | 50 | 1,3 |
4495 | 48 | 0,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
5408 | 39 | resposta(s |
9443 | 18 | Conselheira(o |
38863 | 2 | GAMES(WITH |
40831 | 2 | ROSA(DO |
49348 | 2 | um(a |
51946 | 1 | 2004(Mesquita |
54957 | 1 | A(H1N1 |
55143 | 1 | ANASA(anase |
55473 | 1 | Acção/Aventura(M12 |
56012 | 1 | Almirante(na |
Rank in Wordlist | Frequency | Word |
---|---|---|
19283 | 6 | 0)http%3A%2F%2Fwww |
44280 | 2 | des)governo |
49836 | 1 | 0-0)e |
50077 | 1 | 1)http%3A%2F%2Fwww |
56494 | 1 | Ar(e)s |
56645 | 1 | Art(e)Caminhos |
57431 | 1 | Be3Al2(SiO3)6 |
58197 | 1 | C(F)oderam |
61659 | 1 | ESCS)e |
62348 | 1 | Estado)demitem-se |
Rank in Wordlist | Frequency | Word |
---|---|---|
1857 | 131 | Comentárioshttp%3A%2F%2Fwww |
14086 | 10 | Comentáriohttp%3A%2F%2Fwww |
19283 | 6 | 0)http%3A%2F%2Fwww |
49780 | 1 | 0,2%e |
49794 | 1 | 0,61%para |
50077 | 1 | 1)http%3A%2F%2Fwww |
50127 | 1 | 1,3%do |
50394 | 1 | 10%dos |
50395 | 1 | 10%o |
52374 | 1 | 25%e |
Rank in Wordlist | Frequency | Word |
---|---|---|
8499 | 21 | S&P |
11846 | 13 | I&D |
17956 | 7 | S&P 500 |
29537 | 3 | AT&T |
29830 | 3 | C&A |
31273 | 3 | R&B |
31407 | 3 | S&P500 |
37587 | 2 | C&W |
39060 | 2 | H&M |
39354 | 2 | J&B |
Rank in Wordlist | Frequency | Word |
---|---|---|
50076 | 1 | 1$/barril |
50640 | 1 | 110.000$00 |
51117 | 1 | 1500$00 |
51333 | 1 | 1700$00 |
51787 | 1 | 2.005$00 |
52814 | 1 | 320$00 |
53900 | 1 | 600$00 |
54822 | 1 | 934.258$00 |
66369 | 1 | Ke$ha |
Rank in Wordlist | Frequency | Word |
---|---|---|
40646 | 2 | Poor"s |
49638 | 1 | -"Rupofobia |
60496 | 1 | DEUS",estamos |
63410 | 1 | Franco"nao |
64204 | 1 | Gras"s |
68012 | 1 | Mas,"uma |
69526 | 1 | O"Rourke |
71175 | 1 | Potter"s |
74928 | 1 | Thriller"(1982 |
76025 | 1 | Visão"faz |
Rank in Wordlist | Frequency | Word |
---|---|---|
41158 | 2 | Samuel Eto'o |
41747 | 2 | Umaru Yar'Adua |
56091 | 1 | Amar'e Stoudemire |
61365 | 1 | Donal O'Kelly |
64033 | 1 | Giscard d'Estaing |
64559 | 1 | Harper's Bazaar |
66374 | 1 | Keb' Mo' |
66456 | 1 | King's College |
66610 | 1 | L'Osservatore Romano |
69295 | 1 | Newell's Old Boys |
Rank in Wordlist | Frequency | Word |
---|---|---|
30665 | 3 | L+arte |
41408 | 2 | Sumol+Compal |
53289 | 1 | 45+3 |
53449 | 1 | 5+5 |
53782 | 1 | 6+5 |
54755 | 1 | 90+2 |
58198 | 1 | C+S |
65066 | 1 | IRC+1,5 |
66917 | 1 | Leilões+Arte |
72315 | 1 | Retrato+Figura |
Rank in Wordlist | Frequency | Word |
---|---|---|
88138 | 1 | f*didos |
Rank in Wordlist | Frequency | Word |
---|---|---|
5789 | 35 | Trackbacks/Pingbacks |
6247 | 32 | e/ou |
9084 | 19 | Consultores/Dirigentes |
9438 | 18 | CDS/PP |
13115 | 11 | 2009/2010 |
13444 | 11 | c/quintal |
13445 | 11 | c/som |
15251 | 9 | PT/TVI |
16172 | 8 | 2/3 |
17500 | 7 | 2007/2008 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots